977 resultados para Base Composition


Relevância:

100.00% 100.00%

Publicador:

Resumo:

The rapid increase in genome sequence information has necessitated the annotation of their functional elements, particularly those occurring in the non-coding regions, in the genomic context. Promoter region is the key regulatory region, which enables the gene to be transcribed or repressed, but it is difficult to determine experimentally. Hence an in silico identification of promoters is crucial in order to guide experimental work and to pin point the key region that controls the transcription initiation of a gene. In this analysis, we demonstrate that while the promoter regions are in general less stable than the flanking regions, their average free energy varies depending on the GC composition of the flanking genomic sequence. We have therefore obtained a set of free energy threshold values, for genomic DNA with varying GC content and used them as generic criteria for predicting promoter regions in several microbial genomes, using an in-house developed tool `PromPredict'. On applying it to predict promoter regions corresponding to the 1144 and 612 experimentally validated TSSs in E. coli (50.8% GC) and B. subtilis (43.5% GC) sensitivity of 99% and 95% and precision values of 58% and 60%, respectively, were achieved. For the limited data set of 81 TSSs available for M. tuberculosis (65.6% GC) a sensitivity of 100% and precision of 49% was obtained.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

BACKGROUND: While effective population size (Ne) and life history traits such as generation time are known to impact substitution rates, their potential effects on base composition evolution are less well understood. GC content increases with decreasing body mass in mammals, consistent with recombination-associated GC biased gene conversion (gBGC) more strongly impacting these lineages. However, shifts in chromosomal architecture and recombination landscapes between species may complicate the interpretation of these results. In birds, interchromosomal rearrangements are rare and the recombination landscape is conserved, suggesting that this group is well suited to assess the impact of life history on base composition. RESULTS: Employing data from 45 newly and 3 previously sequenced avian genomes covering a broad range of taxa, we found that lineages with large populations and short generations exhibit higher GC content. The effect extends to both coding and non-coding sites, indicating that it is not due to selection on codon usage. Consistent with recombination driving base composition, GC content and heterogeneity were positively correlated with the rate of recombination. Moreover, we observed ongoing increases in GC in the majority of lineages. CONCLUSIONS: Our results provide evidence that gBGC may drive patterns of nucleotide composition in avian genomes and are consistent with more effective gBGC in large populations and a greater number of meioses per unit time; that is, a shorter generation time. Thus, in accord with theoretical predictions, base composition evolution is substantially modulated by species life history.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Theoretical and empirical studies were conducted on the pattern of nucleotide and amino acid substitution in evolution, taking into account the effects of mutation at the nucleotide level and purifying selection at the amino acid level. A theoretical model for predicting the evolutionary change in electrophoretic mobility of a protein was also developed by using information on the pattern of amino acid substitution. The specific problems studied and the main results obtained are as follows: (1) Estimation of the pattern of nucleotide substitution in DNA nuclear genomes. The pattern of point mutations and nucleotide substitutions among the four different nucleotides are inferred from the evolutionary changes of pseudogenes and functional genes, respectively. Both patterns are non-random, the rate of change varying considerably with nucleotide pair, and that in both cases transitions occur somewhat more frequently than transversions. In protein evolution, substitution occurs more often between amino acids with similar physico-chemical properties than between dissimilar amino acids. (2) Estimation of the pattern of nucleotide substitution in RNA genomes. The majority of mutations in retroviruses accumulate at the reverse transcription stage. Selection at the amino acid level is very weak, and almost non-existent between synonymous codons. The pattern of mutation is very different from that in DNA genomes. Nevertheless, the pattern of purifying selection at the amino acid level is similar to that in DNA genomes, although selection intensity is much weaker. (3) Evaluation of the determinants of molecular evolutionary rates in protein-coding genes. Based on rates of nucleotide substitution for mammalian genes, the rate of amino acid substitution of a protein is determined by its amino acid composition. The content of glycine is shown to correlate strongly and negatively with the rate of substitution. Empirical formulae, called indices of mutability, are developed in order to predict the rate of molecular evolution of a protein from data on its amino acid sequence. (4) Studies on the evolutionary patterns of electrophoretic mobility of proteins. A theoretical model was constructed that predicts the electric charge of a protein at any given pH and its isoelectric point from data on its primary and quaternary structures. Using this model, the evolutionary change in electrophoretic mobilities of different proteins and the expected amount of electrophoretically hidden genetic variation were studied. In the absence of selection for the pI value, proteins will on the average evolve toward a mildly basic pI. (Abstract shortened with permission of author.) ^

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The correspondence between the transversion/transition ratio and the neighboring base composition in chloroplast DNA is examined. For 18 noncoding regions of the chloroplast genome, alignments between rice (Oryza sativa) and maize (Zea mays) were generated by two different methods. Difficulties of aligning noncoding DNA are discussed, and the alignments are analyzed in a manner that reduces alignment artifacts. Sequence divergence is < 10%, so multiple substitutions at a site are assumed to be rare. Observed substitutions were analyzed with respect to the A+T content of the two immediately flanking bases. It is shown that as this content increases, the proportion of transversions also increases. When both the 5'- and 3'-flanking nucleotides are G or C (A+T content of 0), only 25% of the observed substitutions are transversions. However, when both the 5'- and 3'-flanking nucleotides are A or T (A+T content of 2), 57% of the observed substitutions are transversions. Therefore, the influence of flanking base composition on substitutions, previously reported for a single noncoding region, is a general feature of the chloroplast genome.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The base composition pattern (BCP) in the putative promoter region (PPRs) up to 5 Kb lengths of 682 human genes on Chromosome 22 (Chr22) was examined. Two-dimensional (2D) and three-dimensional (3D) functions were designed to delineate the DNA base composition, with four major patterns identified. It is found that 17.6% genes include TATA box, 28.0% GC box, 18.9% CAAT box and 38.4% CpG islands, and approximately 10% genes have one of four putative initiator (Inr) motifs. The occurrence of the promoter elements is tightly associated with the base composition features in the promoter regions, and the associations of the base composition features with occurrence of the promoter elements in the promoter regions mediate tissue-wide expression of the genes in human. The occurrence of two or more promoter elements in the promoter regions is required for the medium- and wide-range expression profiles of the human genes on Chr22. Thus, the reported data shed light on the characteristics of the PPRs of the human genes on Chr22, which may improve our understanding of regulatory roles of the PPRs with occurrence of the promoter elements in gene expression.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Dihalomethanes can produce liver tumors in mice but not in rats, and concern exists about the risk of these compounds to humans. Glutathione (GSH) conjugation of dihalomethanes has been considered to be a critical event in the bioactivation process, and risk assessment is based upon this premise; however, there is little experimental support for this view or information about the basis of genotoxicity. A plasmid vector containing rat GSH S-transferase 5-5 was transfected into the Salmonella typhimurium tester strain TA1535, which then produced active enzyme. The transfected bacteria produced base-pair revertants in the presence of ethylene dihalides or dihalomethanes, in the order CH2Br2 > CH2BrCl > CH2Cl2. However, revertants were not seen when cells were exposed to GSH, CH2Br2, and an amount of purified GSH S-transferase 5-5 (20-fold excess in amount of that expressed within the cells). HCHO, which is an end product of the reaction of GSH with dihalomethanes, also did not produce mutations. S-(1-Acetoxymethyl)GSH was prepared as an analog of the putative S-(1-halomethyl)GSH reactive intermediates. This analog did not produce revertants, consistent with the view that activation of dihalomethanes must occur within the bacteria to cause genetic damage, presenting a model to be considered in studies with mammalian cells. S-(1-Acetoxymethyl)GSH reacted with 2′-deoxyguanosine to yield a major adduct, identified as S-[1-(N2-deoxyguanosinyl)methyl]GSH. Demonstration of the activation of dihalomethanes by this mammalian GSH S-transferase theta class enzyme should be of use in evaluating the risk of these chemicals, particularly in light of reports of the polymorphic expression of a similar activity in humans.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

A new method for computing evolutionary distances between DNA sequences is proposed. Contrasting with classical methods, the underlying model does not assume that sequence base compositions (A, C, G, and T contents) are at equilibrium, thus allowing unequal base compositions among compared sequences. This makes the method more efficient than the usual ones in recovering phylogenetic trees from sequence data when base composition is heterogeneous within the data set, as we show by using both simulated and empirical data. When applied to small-subunit ribosomal RNA sequences from several prokaryotic or eukaryotic organisms, this method provides evidence for an early divergence of the microsporidian Vairimorpha necatrix in the eukaryotic lineage.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Australasian marsupials include three major radiations, the insectivorous/carnivorous Dasyuromorphia, the omnivorous bandicoots (Peramelemorphia), and the largely herbivorous diprotodontians. Morphologists have generally considered the bandicoots and diprotodontians to be closely related, most prominently because they are both syndactylous (with the 2nd and 3rd pedal digits being fused). Molecular studies have been unable to confirm or reject this Syndactyla hypothesis. Here we present new mitochondrial (mt) genomes from a spiny bandicoot (Echymipera rufescens) and two dasyurids, a fat-tailed dunnart (Sminthopsis crassicaudata) and a northern quoll (Dasyurus hallucatus). By comparing trees derived from pairwise base-frequency differences between taxa with standard (absolute, uncorrected) distance trees, we infer that composition bias among mt protein-coding and RNA sequences is sufficient to mislead tree reconstruction. This can explain incongruence between trees obtained from mt and nuclear data sets. However, after excluding major sources of compositional heterogeneity, both the “reduced-bias” mt and nuclear data sets clearly favor a bandicoot plus dasyuromorphian association, as well as a grouping of kangaroos and possums (Phalangeriformes) among diprotodontians. Notably, alternatives to these groupings could only be confidently rejected by combining the mt and nuclear data. Elsewhere on the tree, Dromiciops appears to be sister to the monophyletic Australasian marsupials, whereas the placement of the marsupial mole (Notoryctes) remains problematic. More generally, we contend that it is desirable to combine mt genome and nuclear sequences for inferring vertebrate phylogeny, but as separately modeled process partitions. This strategy depends on detecting and excluding (or accounting for) major sources of nonhistorical signal, such as from compositional nonstationarity.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Ratites are large, flightless birds and include the ostrich, rheas, kiwi, emu, and cassowaries, along with extinct members, such as moa and elephant birds. Previous phylogenetic analyses of complete mitochondrial genome sequences have reinforced the traditional belief that ratites are monophyletic and tinamous are their sister group. However, in these studies ratite monophyly was enforced in the analyses that modeled rate heterogeneity among variable sites. Relaxing this topological constraint results in strong support for the tinamous (which fly) nesting within ratites. Furthermore, upon reducing base compositional bias and partitioning models of sequence evolution among protein codon positions and RNA structures, the tinamou–moa clade grouped with kiwi, emu, and cassowaries to the exclusion of the successively more divergent rheas and ostrich. These relationships are consistent with recent results from a large nuclear data set, whereas our strongly supported finding of a tinamou–moa grouping further resolves palaeognath phylogeny. We infer flight to have been lost among ratites multiple times in temporally close association with the Cretaceous–Tertiary extinction event. This circumvents requirements for transient microcontinents and island chains to explain discordance between ratite phylogeny and patterns of continental breakup. Ostriches may have dispersed to Africa from Eurasia, putting in question the status of ratites as an iconic Gondwanan relict taxon. [Base composition; flightless; Gondwana; mitochondrial genome; Palaeognathae; phylogeny; ratites.]

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Complementary DNAs covering the entire RNA genome of soybean dwarf luteovirus (SDV) were cloned and sequenced. Computer analysis of the 5861 nucleotide sequence revealed five major open reading frames (ORFs) possessing conservation of sequence and organisation with known luteovirus sequences. Comparative analyses of the genome structure show that SDV shares sequence homology and features of gene organisation with barley yellow dwarf virus (PAV isolate) in the 5' half of the genome, yet is more closely related to potato leafroll virus in its 3' coding regions. In addition, SDV differs from other known luteoviruses in possessing an exceptionally long 3' terminal sequence with no apparent coding capacity. We conclude from these data that the SDV genome represents a third variant genome type in the luteovirus group.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Coleoptera is the most diverse group of insects with over 360,000 described species divided into four suborders: Adephaga, Archostemata, Myxophaga, and Polyphaga. In this study, we present six new complete mitochondrial genome (mtgenome) descriptions, including a representative of each suborder, and analyze the evolution of mtgenomes from a comparative framework using all available coleopteran mtgenomes. We propose a modification of atypical cox1 start codons based on sequence alignment to better reflect the conservation observed across species as well as findings of TTG start codons in other genes. We also analyze tRNA-Ser(AGN) anticodons, usually GCU in arthropods, and report a conserved UCU anticodon as a possible synapomorphy across Polyphaga. We further analyze the secondary structure of tRNA-Ser(AGN) and present a consensus structure and an updated covariance model that allows tRNAscan-SE (via the COVE software package) to locate and fold these atypical tRNAs with much greater consistency. We also report secondary structure predictions for both rRNA genes based on conserved stems. All six species of beetle have the same gene order as the ancestral insect. We report noncoding DNA regions, including a small gap region of about 20 bp between tRNA-Ser(UCN) and nad1 that is present in all six genomes, and present results of a base composition analysis.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Analysis of ribosomes and the post ribosomal supernatant fraction of actively growing cells ofThermomyces lanuginosus showed the presence of free 5 S RNA in the supernatant fraction. This 5 S RNA was identical to the ribosomal 5 S RNA in its electrophoretic mobility on 10% Polyacrylamide gel and in its base composition. 5 S RNA from both the sources gave evidence for the presence of diphosphate at the 5’ end. Most of the 5 S RNA that appeared in the cytoplasm was that transported from the nucleus during the isolation. This could be prevented by the use of a hexylene glycol-HEPES buffer.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The minor base composition of Mycobacterium smegmatis tRNA has been studied. Thin-layer chromatographic patterns of a ribonuclease T2 digest of mycobacterial tRNA indicated the presence of appreciable amounts of 1-methyladenosine (which is commonly present only in eucaryotic tRNA), dihydrouridine, and 7-methylguanosine. Ribothymidine was absent. The S-adenosylmethionine-dependent tRNA methylases of M. smegmatis catalyzed the formation of 1-methyladenosine when Escherichia coli tRNA was used as acceptor. Similarly, E. coli extracts methylated the tRNA of M. smegmatis, forming ribothymidine.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Quinoxaline antibiotics (Fig. 1a, b) form a useful group of compounds for the study of drug–nucleic acid interactions1,2. They consist of a cross-bridged cyclic octadepsipeptide, variously modified, bearing two quinoxaline chromophores. These antibiotics intercalate bifunctionally into DNA2,3 probably via the narrow groove, forming a complex in which, most probably, two base pairs are sandwiched between the chromophores4,5. Depending on the nature of their sulphur-containing cross-bridge and modifications to their amino acid side chains, they display characteristic patterns of nucleotide sequence selectivity when binding to DNAs of different base composition and to synthetic polydeoxynucleotides4,6,7. This specificity has been tentatively ascribed to specific hydrogen-bonding interactions between functional groups in the DNA and complementary moieties on the peptide ring2,4,5. Variations in selectivity have been attributed both to changes in the conformation of the peptide backbone6 and no modifications of the cross-bridge7. These suggestions were made, however, in the absence of firm knowledge about the three-dimensional structure and conformation of the antibiotic molecules. We now report the X-ray structure analysis of the synthetic analogue of the antibiotic triostin A, TANDEM (des-N-tetramethyl triostin A) (Fig. 1c), which binds preferentially to alternating adenine-thymine sequences7. The X-ray structure provides a starting point for exploring the origin of this specificity and suggests possible models for the binding of other members of the quinoxaline series.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Atomistic molecular dynamics simulations have been carried out to reveal the characteristic features of ethylenediamine (EDA) cored protonated (corresponding to neutral pH) poly amido amine (PAMAM) dendrimers of generation 3 (G3) and 4 (G4) that are functionalized with single strand DNAs (ssDNAs). The four ssDNA strands that are attached via an alkythiolate [-S(CH(2))(6)-] linker molecule to the free amine groups on the surface of the PAMAM dendrimers are observed to undergo a rapid conformational change during the 25 ns long simulation period. From the RMSD values of ssDNAs, we find relative stability in the case of purine rich (having more adenine and guanine) ssDNA strands than pyrimidine rich (thymine and cytosine) ssDNA strands. The degree of wrapping of ssDNA strands on the dendrimer molecule was found to be influenced by the charge ratio of DNA and the dendrimer. As the G4 dendrimer contains relatively more positive charge than G3 dendrimer, we observe extensive wrapping of ssDNAs on the G4 dendrimer than G3 dendrimer. This might indicate that DNA functionalized G3 dendrimer is more suitable to construct higher order nanostructures. The linker molecule was also found to undergo drastic conformational change during the simulation. During nanosecond long simulation some portion of the linker molecule was found to be lying nearly flat on the surface of the dendrimer molecule. The ssDNA strands along with the linkers are seen to penetrate the surface of the dendrimer molecule and approach closer to the center of the dendrimer indicating the soft sphere nature of the dendrimer molecule. The effective radius of DNA-functionalized dendrimer nanoparticles was found to be independent of base composition of ssDNAs and was observed to be around 19.5 angstrom and 22.4 angstrom when we used G3 and G4 PAMAM dendrimers as the core of the nanoparticle respectively. The observed effective radius of DNA-functionalized dendrimer molecules apparently indicates the significant shrinkage in the structure that has taken place in dendrimer, linker and DNA strands. As a whole our results describe the characteristic features of DNA-functionalized dendrimer nanoparticles and can be used as strong inputs to design effectively the DNA-dendrimer nanoparticle self-assembly for their active biological applications.